In the past few months, I have been watching a very unsettling trend unfold in very competitive ecommerce search results on Google. It appears that huge amounts of money are put into place to systematically and successfully manipulate highly competitive search terms in order to sell fake merchandise of almost every bigger brand out there. Some of these sites even solely exist to steal people’s money and don’t ship anything at all.
Rest assured that I am not talking about some people doing good linkbuilding or about people buying a lot of links. These operations I talk about are much, much bigger and in all cases almost certainly run by criminal organizations of some sort. They not only greatly affect US search results but are also very present in at least UK, France and Germany.
In this article, I will show you several examples of where Googleβs search is absolutely broken (and by broken, I mean that 10 out of 10 page one search results are entirely fraud). I will also show you exactly how these rankings are achieved and take a look at what the impact on consumers may very possibly be. Last but not least, Iβll try to help you recognize these kinds of websites so you can avoid them as they become increasingly difficult to identify.
Exhibit A (βnfl jerseysβ)
Letβs get started with [nfl jerseys] as our first keyword to be examined. If you take a look at the US search results on google.com with personal search disabled (add &pws=0 to any search URL), you will get a list of websites which claim to sell said wear and merchandise at a significantly discounted price. Such huge discounts can be found on pretty much any of these fake shops, many ranging up to 75% in βsavingsβ.
Hereβs the search engine results page as of 01/04/2011:
(Note: Iβm not trying to βoutβ any particular site, so I removed any domain names in question from the screenshots)
As you can easily see, all of these sites feature ridiculous keyword stuffing in their root page titles as well as the term βjerseyβ within their domain name. This is both very common among them. Result #5 even contains Chinese letters.
As of this writing, the entire first results page is composed of fraudulent websites. In other words, Googleβs organic results have become entirely useless for this search phrase.
Exhibit B (βpandora jewelryβ)
Next, [pandora jewelry], also a very popular and well-respected brand. Positions 3, 5, 7, 8, 9 and 10 are fraud, which results in a 60% share of useless results.
Exhibit C (βthomas saboβ)
Looking at [thomas sabo] SERPs, they feel like a dΓ©jΓ -vu. Another jewelry brand, another wave of artificially boosted shops shipping either replica ware or just nothing at all: results 3, 4, 5, 6, 8, 9 and 10 should not be listed there at all in the first place (70%).
All of these sites try to appear as legit and official as possible.
See for yourself
Before diving into the details, I urge you to take a look at Googleβs results for these queries yourself. Try searching for other brands, too. Almost every popular brand is affected by this growing issue.
How they do it
Now that Iβve shown you how seriously broken Google is, letβs take a look at why Google is ranking these sites so well. Since these are no legit shops after all, itβs obvious that there is no kind of βbranding bonusβ or boost through actual social media activity at hand. Thereβs only one thing that leads to these rankings. Youβve guessed it: keyworded anchor-text heavy links.
The interesting question is though: where do these sites get their (anchor-text rich) links?
I have taken a look at many of these sites and found out that their link profiles are basically comprised of two kinds of links: automated forum and blog spam along with some hacked websites.
Letβs take a look at the anchor text variation of result #1 for βnfl jerseysβ:
This site also has a page authority of 64 and a domain authority of 57, according to Open Site Explorer.
Google’s best guess for “pandora jewelry” looks similar:
And the #1 “thomas sabo” result:
You might be surprised to see plain-old forum spam work this well, but let me get one thing straight: itβs not like Google is not penalizing or de-indexing any of these sites. I see them come and go on a daily basis (although some actually seem to stick for weeks or even months).
However, these people (or rather organizations) push such huge amounts of these sites into the web that Google – obviously – is having quite a hard time catching up.
In some way, and this is my personal opinion, this might be related to the Caffeine update – Google is now crawling and ranking sites a lot faster than ever before, but it appears overall search quality has suffered dramatically in the past 6 months or so.
Furthermore, link placements on hacked websites are very difficult to spot algorithmically. Granted, many of those links are not visible to the human eye and that should raise some flags since Google is capable of rendering any page, but overall itβs not comparable to catching automated posts on tens of thousands of web forums.
What really should have set Google’s alarm off, though, are the link growth patterns. Let’s take a look at the “nfl jerseys” top 3:
Two of these sites started spamming back in April 2010 and are still ranking in January 2011. Go figure.
Same goes for “pandora jewelry” and “thomas sabo”:
You get the picture.
What Google needs to do about it
Rand talked about it already, and his advice is instantly applicable to this issue: Google needs to greatly lower the value of keyword-rich anchor texts.
Think about it: if Google had not at all taken anchor text into account for these sites, none of them would probably rank anywhere near the top 10 results. Their links come from very different sources, and almost none of those sources is even remotely related to what their pretending to be selling.
As long as anchor text links outrank links from actually related websites, this is not going away anytime soon. Same goes for exact match keyword domains, by the way.
I do realize that anchor text is very important, but its abuse has reached a point where itβs no longer a ranking signal to be trusted as much as it currently is. Heck, I’ve actually seen websites rank #3 for these terms with one single sentence on the page: “seized by Department of Homeland Security“.
What it means for SEO
Google has a serious problem, and Iβm sure that they have been working on it relentlessly for quite some time now.
What it means for SEO is that whatever is working for the sites mentioned in this article – it will probably stop working soon. I would not be surprised to see Google shift even more ranking signal power from anchor-text heavy links to relevant social media βchatterβ. I have a feeling that itβs gaining more traction as we speak.
Of course, tweets and status updates can be spammed, bought and faked, too. But at least it will buy Google some time.
This fight is never over nor ever “won” by anyone. Ever.
How to identify these sites as a consumer
Since I donβt want any of you to order from these guys and receive either fake goods or nothing at all, hereβs some advice to identify them:
Most of these sites:
- offer unrealistic discounts (>=50% are pretty much everywhere)
- have no actual postal address
- supply only a contact form or
- supply only a GMail/Hotmail email address to contact them
- feature way too many βtrusted logosβ in their footer
- are written in poor English
Considering that most of the sites I talked about earlier already ranked well while all the holiday shopping took place, I can only imagine the damage done to thousands of families and individuals.
Please be cautious and remember that if a deal sounds too good to be true, it very probably is.
Since this is my first article for SEOmoz, please let me know in the comments if you liked this article and give me a βthumbs upβ if you did. In case youβve even been affected by this kind of fraud personally, Iβd love to hear from you, too.
– Rouven Balci, SEO at Toms Gutscheine